Search CORE

21 research outputs found

Inferring Haplotypes of Copy Number Variations From High-Throughput Data With Uncertainty

Author: Hosono Naoya
Kato Mamoru
Leotta Anthony
Sebat Jonathan
Tsunoda Tatsuhiko
Yoon Seungtai
Zhang Michael Q.
Publication venue: Genetics Society of America
Publication date
Field of study

Accurate information on haplotypes and diplotypes (haplotype pairs) is required for population-genetic analyses; however, microarrays do not provide data on a haplotype or diplotype at a copy number variation (CNV) locus; they only provide data on the total number of copies over a diplotype or an unphased sequence genotype (e.g., AAB, unlike AB of single nucleotide polymorphism). Moreover, such copy numbers or genotypes are often incorrectly determined when microarray signal intensities derived from different copy numbers or genotypes are not clearly separated due to noise. Here we report an algorithm to infer CNV haplotypes and individuals’ diplotypes at multiple loci from noisy microarray data, utilizing the probability that a signal intensity may be derived from different underlying copy numbers or genotypes. Performing simulation studies based on known diplotypes and an error model obtained from real microarray data, we demonstrate that this probabilistic approach succeeds in accurate inference (error rate: 1–2%) from noisy data, whereas previous deterministic approaches failed (error rate: 12–18%). Applying this algorithm to real microarray data, we estimated haplotype frequencies and diplotypes in 1486 CNV regions for 100 individuals. Our algorithm will facilitate accurate population-genetic analyses and powerful disease association studies of CNVs

Crossref

PubMed Central

Big Data Pipelines on the Computing Continuum: Tapping the Dark Data

Author: Elvesæter Brian
Kharlamov Evgeny
Kimovski Dragi
Ledakis Giannis
Leotta Francesco
Marrella Andrea
Matskin Mihhail
Nikolov Nikolay
Prodan Radu
Roman Dumitru
Simonet-Boulogne Anthony
Song Hui
Soylu Ahmet
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2022
Field of study

The computing continuum enables new opportunities for managing big data pipelines concerning efficient management of heterogeneous and untrustworthy resources. We discuss the big data pipelines lifecycle on the computing continuum and its associated challenges, and we outline a future research agenda in this area.acceptedVersio

SINTEF Open

DataCloud: Enabling the Big Data Pipelines on the Computing Continuum

Author: Benvenuti Dario
Ceccarelli Raffaele
Elvesæter Brian
Kharlamov Evgeny
Kimovski Dragi
Ledakis Giannis
Leotta Francesco
Marrella Andrea
Matskin Mihhail
Nikolov Nikolay
Perales Fernando
Prodan Radu
Roman Dumitru
Simonet-Boulogne Anthony
Solberg Arnor
Soylu Ahmet
Ulisses Alexandre
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2021
Field of study

acceptedVersio

SINTEF Open

Big Data Pipelines on the Computing Continuum: Ecosystem and Use Cases Overview

Author: Ceccarelli Raffaele
Elvesæter Brian
Kharlamov Evgeny
Kimovski Dragi
Ledakis Giannis
Leotta Francesco
Marrella Andrea
Matskin Mihhail
Nikolov Nikolay
Perales Fernando
Prodan Radu
Roman Dumitru
Simonet-Boulogne Anthony
Solberg Arnor
Song Hui
Soylu Ahmet
Theodosiou Konstantinos
Ulisses Alexandre
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

Organisations possess and continuously generate huge amounts of static and stream data, especially with the proliferation of Internet of Things technologies. Collected but unused data, i.e., Dark Data, mean loss in value creation potential. In this respect, the concept of Computing Continuum extends the traditional more centralised Cloud Computing paradigm with Fog and Edge Computing in order to ensure low latency pre-processing and filtering close to the data sources. However, there are still major challenges to be addressed, in particular related to management of various phases of Big Data processing on the Computing Continuum. In this paper, we set forth an ecosystem for Big Data pipelines in the Computing Continuum and introduce five relevant real-life example use cases in the context of the proposed ecosystem.acceptedVersio

SINTEF Open

ZENODO

NORA - Norwegian Open Research Archives

Microduplications of 16p11.2 are associated with schizophrenia

Recurrent microdeletions and microduplications of a 600 kb genomic region of chromosome 16p11.2 have been implicated in childhood-onset developmental disorders1-3. Here we report the strong association of 16p11.2 microduplications with schizophrenia in two large cohorts. In the primary sample, the microduplication was detected in 12/1906 (0.63%) cases and 1/3971 (0.03%) controls (P=1.2×10-5, OR=25.8). In the replication sample, the microduplication was detected in 9/2645 (0.34%) cases and 1/2420 (0.04%) controls (P=0.022, OR=8.3). For the series combined, microduplication of 16p11.2 was associated with 14.5-fold increased risk of schizophrenia (95% C.I. [3.3, 62]). A meta-analysis of multiple psychiatric disorders showed a significant association of the microduplication with schizophrenia, bipolar disorder and autism. The reciprocal microdeletion was associated only with autism and developmental disorders. Analysis of patient clinical data showed that head circumference was significantly larger in patients with the microdeletion compared with patients with the microduplication (P = 0.0007). Our results suggest that the microduplication of 16p11.2 confers substantial risk for schizophrenia and other psychiatric disorders, whereas the reciprocal microdeletion is associated with contrasting clinical features

Carolina Digital Repository

Microduplications of 16p11.2 are associated with schizophrenia

Author: Addington Anjene M.
Bhandari Abhishek
Chitkara Nisha
Christian Susan L.
Cichon Sven
Craddock Nick
Crow Timothy J.
DeLisi Lynn E.
DeRosse Pamela
Deutsch Curtis K.
Dickel Diane E.
Gallagher Louise
Ganesh Jaya
Gary Sydney
Gill Michael
Goodell Meredith
Grozeva Detelina
Haldeman-Englert Chad
Iakoucheva Lilia M
Kaplan Paige
Kassem Layla
Kendall Jude
King Mary-Claire
Kirov George
Krantz Ian D.
Krastoshevsky Olga
Krause Verena
Kumar Ravinesh A.
Kusenda Mary
Kustanovich Vlad
Lajonchere Clara M.
Lakshmi B.
Lee Yoon-ha
Lehtimäki Terho
Leibenluft Ellen
Leotta Anthony
Levy Deborah L.
Lieberman Jeffrey A.
Makarov Vladimir
Malhotra Anil K.
Malhotra Dheeraj
McCarthy Shane E.
McClellan Jon
McMahon Francis J.
Mendell Nancy R.
Nöthen Markus M.
Owen Michael J.
O’Donovan Michael C.
Pavon Kevin
Pearl Justin
Perkins Diana
Potash James B.
Puura Kaija
Rapoport Judith
Rietschel Marcella
Roccanova Patricia
Schulze Thomas G.
Sebat Jonathan
Shaikh Tamim H.
Skuse David
Spinner Nancy B.
Steele Jo
Stroup T. Scott
Sullivan Patrick
Susser Ezra
Sutcliffe James S.
Vacic Vladimir
Walsh Tom
Wellcome Trust Case Control Consortium
Willour Virginia L.
Wolff Jessica
Yoon Seungtai
Zackai Elaine H.
Publication venue
Publication date: 01/01/2009
Field of study

Recurrent microdeletions and microduplications of a 600-kb genomic region of chromosome 16p11.2 have been implicated in childhood-onset developmental disorders1,2,3. We report the association of 16p11.2 microduplications with schizophrenia in two large cohorts. The microduplication was detected in 12/1,906 (0.63%) cases and 1/3,971 (0.03%) controls (P = 1.2 × 10−5, OR = 25.8) from the initial cohort, and in 9/2,645 (0.34%) cases and 1/2,420 (0.04%) controls (P = 0.022, OR = 8.3) of the replication cohort. The 16p11.2 microduplication was associated with a 14.5-fold increased risk of schizophrenia (95% CI (3.3, 62)) in the combined sample. A meta-analysis of datasets for multiple psychiatric disorders showed a significant association of the microduplication with schizophrenia (P = 4.8 × 10−7), bipolar disorder (P = 0.017) and autism (P = 1.9 × 10−7). In contrast, the reciprocal microdeletion was associated only with autism and developmental disorders (P = 2.3 × 10−13). Head circumference was larger in patients with the microdeletion than in patients with the microduplication (P = 0.0007)

Carolina Digital Repository

Mouse genomic representational oligonucleotide microarray analysis: Detection of copy number variations in normal and tumor specimens

Author: Alexander Joan
Egan Christopher
Hall Ira M.
Healy John
Lakshmi B.
Leotta Anthony
Lowe Scott W.
Lucito Robert
Spector Mona S.
Wigler Michael
Xue Wen
Zender Lars
Publication venue: National Academy of Sciences
Publication date: 14/07/2006
Field of study

Genomic amplifications and deletions, the consequence of somatic variation, are a hallmark of human cancer. Such variation has also been observed between “normal” individuals, as well as in individuals with congenital disorders. Thus, copy number measurement is likely to be an important tool for the analysis of genetic variation, genetic disease, and cancer. We developed representational oligonucleotide microarray analysis, a high-resolution comparative genomic hybridization methodology, with this aim in mind, and reported its use in the study of humans. Here we report the development of a representational oligonucleotide microarray analysis microarray for the genomic analysis of the mouse, an important model system for many genetic diseases and cancer. This microarray was designed based on the sequence assembly MM3, and contains ≈84,000 probes randomly distributed throughout the mouse genome. We demonstrate the use of this array to identify copy number changes in mouse cancers, as well to determine copy number variation between inbred strains of mice. Because restriction endonuclease digestion of genomic DNA is an integral component of our method, differences due to polymorphisms at the restriction enzyme cleavage sites are also observed between strains, and these can be useful to follow the inheritance of loci between crosses of different strains

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

DataCloud: Enabling the Big Data Pipelines on the Computing Continuum

Author: Benvenuti Dario
Ceccarelli Raffaele
Elvesæter Brian
Kharlamov Evgeny
Kimovski Dragi
Ledakis Giannis
Leotta Francesco
Marrella Andrea
Matskin Mihhail
Nikolov Nikolay
Perales Fernando
Prodan Radu
Roman Dumitru
Simonet-Boulogne Anthony
Solberg Arnor
Soylu Ahmet
Ulisses Alexandre
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2021
Field of study

With the recent developments of Internet of Things (IoT) and cloud-based technologies, massive amounts of data are generated by heterogeneous sources and stored through dedicated cloud solutions. Often organizations generate much more data than they are able to interpret, and current Cloud Computing technologies cannot fully meet the requirements of the Big Data processing applications and their data transfer overheads. Many data are stored for compliance purposes only but not used and turned into value, thus becoming Dark Data, which are not only an untapped value, but also pose a risk for organizations

SINTEF Open

NORA - Norwegian Open Research Archives

DataCloud: Enabling the Big Data Pipelines on the Computing Continuum

Author: Benvenuti Dario
Ceccarelli Raffaele
Elvesæter Brian
Kharlamov Evgeny
Kimovski Dragi
Ledakis Giannis
Leotta Francesco
Marrella Andrea
Matskin Mihhail
Nikolov Nikolay
Perales Fernando
Prodan Radu
Roman Dumitru
Simonet-Boulogne Anthony
Solberg Arnor
Soylu Ahmet
Ulisses Alexandre
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2021
Field of study

NORA - Norwegian Open Research Archives